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NOVEL USE 



FIELD OF INVENTION 

This invention relates to new uses for polynucleotides and polypeptides encoded by 
them and to their production. More particularly, the polynucleotides and polypeptides of the 
present invention relate to the clock gene family, hereinafter referred to as HSCLOCK. The 
invention also relates to inhibiting or activating the action of such polynucleotides and 
polypeptides. 

BACKGROUND OF THE INVENTION 

Circadian rhythmicity represents a complex behavioural and physiological phenotype. 
Clock genes have been identified in non-mammalian organisms, most notably, the period (per) 
and timeless (tim) genes in Drosophila and the frequency (frq) gene in Neurospora (reviewed in 
J.C. Hall, Trends Neurosci., 18: 230-240, 1995; J. C. Dunlap, Ann.Rev.Genet., 30: 579-601, 

1996) . A mouse clock gene has recently been identified by positional cloning (D. P. King et al., 
Cell, 89: 641-653, 1997). Mutation and transgenic studies (M. P. Antoch et al, Cell, 89: 655- 
667, 1997) confirm the involvement of the clock gene in circadian rhythmicity. The mouse clock 
protein contains both a DNA binding domain and a protein dimerisation domain indicating that it 
could, in combination with other proteins, regulate circadian rhythmicity by regulating gene 
transcription. The pattern of mouse clock gene expresssion is consistent with its role in circadian 
rhythms with highest levels of expression in the hypothalamus and eye, both of which are known 
to contain self-sustaining circadian oscillators. The mouse clock gene is also expressed in many 
tissues throughout the body. Similarly the drosophila gene has a wide tissue distribution pattern. 

Recently Nagase,T. et al (DNA Res. 4, 141-150, 1997) published a set of full-length, 
but unidentified cDNAs expressed in human brain. The present invention identifies one of 
these cDNA sequences as encoding a human clock gene. The gene from which this human 
clock cDNA has been derived has been mapped to chromosome 4 (Nagase,T. et al (DNA Res. 
4, 141-150, 1997). It has been predicted independently that the human clock gene maps to 
4ql2-4ql3 by synteny with the mouse clock locus (King, DP et al Genetics 146, 1049-1060, 

1997) . 

SUMMARY OF THE INVENTION 

In one aspect, the invention relates to the use of HSCLOCK polynucleotides and 
polypeptides. Such uses include the treatment of sleep disorders, jet lag, pathologies that occur 
in advanced age, among others. In a further aspect the invention relates to HSCLOCK 
recombinant materials and methods for their production. Another aspect of the invention relates 
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to methods for using such recombinant HSCLOCK polypeptides and polynucleotides. In still 
another aspect, the invention relates to methods to identify agonists and antagonists using the 
materials provided by the invention, and treating conditions associated with HSCLOCK 
imbalance with the identified compounds. Yet another aspect of the invention relates to 
diagnostic assays for detecting diseases associated with inappropriate HSCLOCK activity or 
levels. 

DESCRIPTION OF THE INVENTION 

Definitions 

The following definitions are provided to facilitate understanding of certain terms used 
frequently herein. 

"ucpt nr^v" m f*± r c* omnnft ^fVi^c rr~r*«ra1K/ to a nn1\/nentiHp havina the amino acid 
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sequence set forth in SEQ ID NO:2 or an allelic variant thereof. 

"HSCLOCK activity or HSCLOCK polypeptide activity" or "biological activity of the 
HSCLOCK or HSCLOCK polypeptide" refers to the metabolic or physiologic function of said 
HSCLOCK including similar activities or improved activities or these activities with decreased 
undesirable side-effects. Also included are antigenic and immunogenic activities of said 
HSCLOCK. 

"HSCLOCK gene" refers to a polynucleotide having the nucleotide sequence set forth 
in SEQ ID NO:l or allelic variants thereof and/or their complements. 

"Antibodies" as used herein includes polyclonal and monoclonal antibodies, chimeric, 
single chain, and humanized antibodies, as well as Fab fragments, including the products of an 
Fab or other immunoglobulin expression library. 

"Isolated" means altered "by the hand of man" from the natural state. If an "isolated" 
composition or substance occurs in nature, it has been changed or removed from its original 
environment, or both. For example, a polynucleotide or a polypeptide naturally present in a 
living animal is not "isolated," but the same polynucleotide or polypeptide separated from the 
coexisting materials of its natural state is "isolated", as the term is employed herein. 

"Polynucleotide" generally refers to any polyribonucleotide or 
polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA. 
"Polynucleotides" include, without limitation single- and double-stranded DNA, DNA that is a 
mixture of single- and double-stranded regions, single- and double-stranded RNA, and RNA 
that is mixture of single- and double-stranded regions, hybrid molecules comprising DNA and 
RNA that may be single-stranded or, more typically, double-stranded or a mixture of single- 
and double-stranded regions. In addition, "polynucleotide" refers to triple-stranded regions 
comprising RNA or DNA or both RNA and DNA. The term polynucleotide also includes 
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DNAs or RNAs containing one or more modified bases and DNAs or RNAs with backbones 
modified for stability or for other reasons. "Modified" bases include, for example, tritylated 
bases and unusual bases such as inosine. A variety of modifications has been made to DNA 
and RNA; thus, "polynucleotide" embraces chemically, enzymatically or metabolically 
modified forms of polynucleotides as typically found in nature, as well as the chemical forms 
of DNA and RNA characteristic of viruses and cells. "Polynucleotide" also embraces 
relatively short polynucleotides, often referred to as oligonucleotides. 

"Polypeptide" refers to any peptide or protein comprising two or more amino acids 
joined to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres. 
"Polypeptide" refers to both short chains, commonly referred to as peptides, oligopeptides or 
oligomers, and to longer chains, generally referred to as proteins. Polypeptides may contain 
amino acids other than the 20 gene-encoded amino acids. "Polypeptides" include amino acid 
sequences modified either by natural processes, such as posttranslational processing, or by 
chemical modification techniques which are well known in the art. Such modifications are 
well described in basic texts and in more detailed monographs, as well as in a voluminous 
research literature. Modifications can occur anywhere in a polypeptide, including the peptide 
backbone, the amino acid side-chains and the amino or carboxyl termini. It will be appreciated 
that the same type of modification may be present in the same or varying degrees at several 
sites in a given polypeptide. Also, a given polypeptide may contain many types of 
modifications. Polypeptides may be branched as a result of ubiquitination, and they may be 
cyclic, with or without branching. Cyclic, branched and branched cyclic polypeptides may 
result from posttranslation natural processes or may be made by synthetic methods. 
Modifications include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 
nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent 
attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, 
demethylation, formation of covalent cross-links, formation of cystine, formation of 
pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI anchor formation, 
hydroxylation, iodination, methylation, myristoylation, oxidation, proteolytic processing, 
phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer-RNA mediated 
addition of amino acids to proteins such as arginylation, and ubiquitination. See, for instance, 
PROTEINS -STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton, 
W. H. Freeman and Company, New York, 1993 and Wold, F., Posttranslational Protein 
Modifications: Perspectives and Prospects, pgs. 1-12 in POSTTRANSLATIONAL 
COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, Ed., Academic Press, New 
York, 1983; Seifter et aL, "Analysis for protein modifications and nonprotein cofactors", Meth 
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Enzymol (1990) 182:626-646 and Rattan et aL, "Protein Synthesis: Posttranslational 
Modifications and Aging", Ann NYAcadSci (1992) 663:48-62. 

"Variant" as the term is used herein, is a polynucleotide or polypeptide that differs 
from a reference polynucleotide or polypeptide respectively, but retains essential properties. A 
typical variant of a polynucleotide differs in nucleotide sequence from another, reference 
polynucleotide. Changes in the nucleotide sequence of the variant may or may not alter the 
amino acid sequence of a polypeptide encoded by the reference polynucleotide. Nucleotide 
changes may result in amino acid substitutions, additions, deletions, fusions and truncations in 
the polypeptide encoded by the reference sequence, as discussed below. A typical variant of a 
polypeptide differs in amino acid sequence from another, reference polypeptide. Generally, 
differences are limited so that the sequences of the reference polypeptide and the variant are 
closely similar overall and, in many regions, identical. A variant and reference polypeptide 
may differ in amino acid sequence by one or more substitutions, additions, deletions in any 
combination. A substituted or inserted amino acid residue may or may not be one encoded by 
the genetic code. A variant of a polynucleotide or polypeptide may be a naturally occurring 
such as an allelic variant, or it may be a variant that is not known to occur naturally. Non- 
naturally occurring variants of polynucleotides and polypeptides may be made by mutagenesis 
techniques or by direct synthesis. 

"Percent identity'*, as known in the art, is a measure of the relationship between two 
polypeptide sequences or two polynucleotide sequences, as determined by comparing their 
sequences. In general, the two sequences to be compared are aligned to give a maximum 
correlation between the sequences. The alignment of the two sequences is examined and the 
number of positions giving an exact amino acid or nucleotide correspondence between the two 
sequences determined, divided by the total length of the alignment and multiplied by 100 to 
give a % identity figure. This % identity figure may be determined over the whole length of 
the sequences to be compared, which is particularly suitable for sequences of the same or very 
similar length and which are highly homologous, or over shorter defined lengths, which is 
more suitable for sequences of unequal length or which have a lower level of homology. 

"Percent similarity", as known in the art, is a further measure of the relationship 
between two polypeptide sequences. The two sequences being compared are aligned to give 
maximum correlation between the sequences. The alignment of the two sequences is 
examined at each position and a score is determined according to the chemical and/or physical 
properties of the amino acids being compared at that position. Such chemical and/or physical 
properties include charge, size and hydrophobicity of the amino acid side chains. 
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Methods for comparing the identity and similarity of two or more sequences are well 
known in the art. Thus for instance, programs available in the Wisconsin Sequence Analysis 
Package, version 9.1 (Devereux J et al, Nucleic Acids Res, 12, 387-395, 1984, available from 
Genetics Computer Group, Madison, Wisconsin, USA), for example the programs BESTFIT 
and GAP, may be used to determine the % identity between two polynucleotides and the % 
identity and the % similarity between two polypeptide sequences. BESTFIT uses the "local 
homology" algorithm of Smith and Waterman (Advances in Applied Mathematics, 2, 482-489, 
1981) and finds the best single region of similarity between two sequences. BESTFIT is more 
suited to comparing two polynucleotide or two polypeptide sequences which are dissimilar in 
length, the program assuming that the shorter sequence represents a portion of the longer. In 
comparison, GAP aligns two sequences, finding a "maximum similarity", according to the 

olrr^'tk™ AfNA^HIPtnQn f>rxA Wnnc^Vi fl Mol T*in1 dd'l-dSI 1 Q7fH GAP is more suited to 

comparing sequences which are approximately the same length and an alignment is expected 
over the entire length. Preferably, the parameters "Gap Weight" and "Length Weight" used in 
each program are 50 and 3, for polynucleotide sequences and 12 and 4 for polypeptide 
sequences, respectively. Preferably, % identities and similarities are determined when the two 
sequences being compared are optimally aligned. 

Other programs for determining identity and/or similarity between sequences are also 
known in the art, for instance the BLAST family of programs (Altschul S F et al, J Mol Biol, 
215, 403-410, 1990, Altschul S F et al, Nucleic Acids Res., 25:389-3402, 1997, available from 
the National Center for Biotechnology Information (NCBI), Bethesda, Maryland, USA and 
accessible through the home page of the NCBI at www.ncbi.nlm.nih.gov) and FASTA 
(Pearson W R and Lipman D J, Proc Nat Acad Sci USA, 85, 2444-2448,1988, available as part 
of the Wisconsin Sequence Analysis Package). Preferably, the BLOSUM62 amino acid 
substitution matrix (Henikoff S and Henikoff J G, Proc. Nat. Acad Sci. USA, 89, 10915-10919, 
1992) is used in polypeptide sequence comparisons including where nucleotide sequences are 
first translated into amino acid sequences before comparison. 

Preferably, the program BESTFIT is used to determine the % identity of a query 
polynucleotide or a polypeptide sequence with respect to a polynucleotide or a polypeptide 
sequence of the present invention, the query and the reference sequence being optimally 
aligned and the parameters of the program set at the default value. 

Polypeptides of the Invention 

In one aspect, the present invention relates to HSCLOCK polypeptides (or HSCLOCK 
proteins). The HSCLOCK polypeptides include the polypeptide of SEQ ID NO:2; as well as 
polypeptides comprising the amino acid sequence of SEQ ID NO: 2; and polypeptides 
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comprising an amino acid sequence which has at least 80% identity to that of SEQ ID NO:2 
over its entire length, and still more preferably at least 90% identity, and even still more 
preferably at least 95% identity to SEQ ID NO: 2. Furthermore, those amino acid sequences 
with at least 97-99% are highly preferred. Also included within HSCLOCK polypeptides are 
polypeptides having an amino acid sequence which has at least 80% identity to the polypeptide 
sequence of SEQ ID NO:2 over its entire length, and still more preferably at least 90% 
identity, and still more preferably at least 95% identity to SEQ ID NO:2. Furthermore, those 
with at least 97-99% are highly preferred. Preferably HSCLOCK polypeptide exhibit at least 
one biological activity of HSCLOCK. 

The HSCLOCK polypeptides may be in the form of the "mature'* protein or may be a 
part of a larger protein such as a fusion protein. It is often advantageous to include an 
additional amino acid sequence which contains secretory or leader sequences pro-sequenceS; 
sequences which aid in purification such as multiple histidine residues, or an additional 
sequence for stability during recombinant production. 

Fragments of the HSCLOCK polypeptides are also included in the invention. A 
fragment is a polypeptide having an amino acid sequence that entirely is the same as part, but not 
all, of the amino acid sequence of the aforementioned HSCLOCK polypeptides. As with 
HSCLOCK polypeptides, fragments may be "free-standing," or comprised within a larger 
polypeptide of which they form a part or region, most preferably as a single continuous region. 
Representative examples of polypeptide fragments of the invention, include, for example, 
fragments from about amino acid number 1-20, 21-40, 41-60, 61-80, 81-100, and 101 to the end 
of HSCLOCK polypeptide. In this context "about" includes the particularly recited ranges larger 
or smaller by several, 5, 4, 3, 2 or 1 amino acid at either extreme or at both extremes. 

Preferred fragments include, for example, truncation polypeptides having the amino acid 
sequence of HSCLOCK polypeptides, except for deletion of a continuous series of residues that 
includes the amino terminus, or a continuous series of residues that includes the carboxyl 
terminus or deletion of two continuous series of residues, one including the amino terminus and 
one including the carboxyl terminus. Also preferred are fragments characterized by structural or 
functional attributes such as fragments that comprise alpha-helix and alpha-helix forming 
regions, beta-sheet and beta-sheet- forming regions, turn and turn-forming regions, coil and coil- 
forming regions, hydrophilic regions, hydrophobic regions, alpha amphipathic regions, beta 
amphipathic regions, flexible regions, surface-forming regions, substrate binding region, and high 
antigenic index regions. Other preferred fragments are biologically active fragments. 
Biologically active fragments are those that mediate HSCLOCK activity, including those with a 
similar activity or an improved activity, or with a decreased undesirable activity. Also included 
are those that are antigenic or immunogenic in an animal, especially in a human. 
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Preferably, all of these polypeptide fragments retain the biological activity of the 
HSCLOCK, including antigenic activity. Variants of the defined sequence and fragments also 
form part of the present invention. Preferred variants are those that vary from the referents by 
conservative amino acid substitutions — i.e., those that substitute a residue with another of like 
characteristics. Typical such substitutions are among Ala, Val, Leu and He; among Ser and Thr; 
among the acidic residues Asp and Glu; among Asn and Gin; and among the basic residues Lys 
and Arg; or aromatic residues Phe and Tyr. Particularly preferred are variants in which several, 
5-10, 1-5, or 1-2 amino acids are substituted, deleted, or added in any combination. 

The HSCLOCK polypeptides of the invention can be prepared in any suitable manner. 
Such polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a combination of 

lVi/-»co *v»#»fhr*r1e Ayfportc fVvr rw=»r»arinfr cnr»Vi nnhmfntiflpc nrp WP.11 linHp.rstnnH in the art. 
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Polynucleotides of the Invention 

Another aspect of the invention relates to HSCLOCK polynucleotides. HSCLOCK 
polynucleotides include isolated polynucleotides which encode the HSCLOCK polypeptides and 
fragments, and polynucleotides closely related thereto. More specifically, HSCLOCK 
polynucleotides of the invention include a polynucleotide comprising the nucleotide sequence 
contained in SEQ ID NO: 1 encoding a HSCLOCK polypeptide of SEQ ID NO: 2, and a 
polynucleotide having the particular sequence of SEQ ID NO: 1 . HSCLOCK polynucleotides 
further include a polynucleotide comprising a nucleotide sequence that has at least 80% identity 
over its entire length to a nucleotide sequence encoding the HSCLOCK polypeptide of SEQ ID 
NO:2, and a polynucleotide comprising a nucleotide sequence that is at least 80% identical to of 
SEQ ID NO: 1 over its entire length. In this regard, polynucleotides at least 90% identical are 
particularly preferred, and those with at least 95% are especially preferred. Furthermore, those 
with at least 97% identity are highly preferred and those with at least 98-99% are most highly 
preferred, polynucleotides with at least 99% being the most preferred. Also included under 
HSCLOCK polynucleotides is a nucleotide sequence which has sufficient identity to a 
nucleotide sequence contained in SEQ ID NO:l to hybridize under conditions useable for 
amplification or for use as a probe or marker. The invention also provides polynucleotides 
which are complementary to such HSCLOCK polynucleotides. 

HSCLOCK of the invention is structurally related to other proteins of the clock gene 
family. The cDNA sequence of SEQ ED NO:l contains an open reading frame (nucleotide 
number 252 to 2789) encoding a polypeptide of 846 amino acids of SEQ ID NO:2. Amino acid 
sequence of SEQ ID NO:2 has about 96% identity (using Smith- Waterman) in 846 amino acid 
residues with mouse clock protein (D. P. King et al, Cell 89: 641-653, 1997). The nucleotide 
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sequence of SEQ ID NO: 1 has about 87% identity (using Smith- Waterman) in 4663 nucleotide 
residues with the cDNA encoding the mouse clock protein (D. P. King et al, Cell 89: 641-653, 
1997). 

An HSCLOCK polynucleotide may be obtained using standard cloning and screening, 
from a cDNA library derived from mRNA in cells of human brain using the expressed sequence 
tag (EST) analysis (Adams, M.D., et al Science (1991) 252:1651-1656; Adams, M.D. et al, 
Nature, (1992) 555:632-634; Adams, M.D., et al, Nature (1995) 377 Supp:3-174). 
Polynucleotides of the invention can also be obtained from natural sources such as genomic 
DNA libraries or can be synthesized using well known and commercially available techniques. 

The nucleotide sequence encoding HSCLOCK polypeptide of SEQ ID NO: 2 may be 
identical to the polypeptide encoding sequence contained in SEQ ID NO: 1 (nucleotide number 
to 3789Y or it mav he. a sennence. which as a result of the redundancv (deeeneracvl of the 
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genetic code, also encodes the polypeptide of SEQ ED NO:2. 

When the polynucleotides of the invention are used for the recombinant production of 
HSCLOCK polypeptide, the polynucleotide may include the coding sequence for the mature 
polypeptide or a fragment thereof, by itself; the coding sequence for the mature polypeptide or 
fragment in reading frame with other coding sequences, such as those encoding a leader or 
secretory sequence, a pre-, or pro- or prepro- protein sequence, or other fusion peptide portions. 
For example, a marker sequence which facilitates purification of the fused polypeptide can be 
encoded. In certain preferred embodiments of this aspect of the invention, the marker sequence is 
a hexa-histidine peptide, as provided in the pQE vector (Qiagen, Inc.) and described in Gentz et 
al, Proc Natl Acad Sci USA (1989) 86:821-824, or is an HA tag. The polynucleotide may also 
contain non-coding 5' and 3' sequences, such as transcribed, non-translated sequences, splicing 
and polyadenylation signals, ribosome binding sites and sequences that stabilize mRNA. 

Further preferred embodiments are polynucleotides encoding HSCLOCK variants 
comprise the amino acid sequence HSCLOCK polypeptide of SEQ ID NO:2 in which several, 
5-10, 1-5, 1-3, 1-2 or 1 amino acid residues are substituted, deleted or added, in any combination. 

The present invention further relates to polynucleotides that hybridize to the herein 
above-described sequences. In this regard, the present invention especially relates to 
polynucleotides which hybridize under stringent conditions to the herein above-described 
polynucleotides. As herein used, the term "stringent conditions" means hybridization will occur 
only if there is at least 80%, and preferably at least 90%, and more preferably at least 95%, yet 
even more preferably 97-99% identity between the sequences. 

Polynucleotides of the invention, which are identical or sufficiently identical to a 
nucleotide sequence contained in SEQ ID NO: 1 or a fragment thereof, may be used as 
hybridization probes for cDNA and genomic DNA, to isolate full-length cDNAs and genomic 
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clones encoding HSCLOCK polypeptide and to isolate cDNA and genomic clones of other genes 
(including genes encoding homologs and orthologs from species other than human) that have a 
high sequence similarity to the HSCLOCK gene. Such hybridization techniques are known to 
those of skill in the art. Typically these nucleotide sequences are 80% identical, preferably 90% 
identical, more preferably 95% identical to that of the referent. The probes generally will 
comprise at least 15 nucleotides. Preferably, such probes will have at least 30 nucleotides and 
may have at least 50 nucleotides. Particularly preferred probes will range between 30 and 50 
nucleotides. 

In one embodiment, to obtain a polynucleotide encoding HSCLOCK polypeptide, 
including homologs and orthologs from species other than human, comprises the steps of 
screening an appropriate library under stingent hybridization conditions with a labeled probe 
having the SEQ ID NO: 1 or a fragment thereof; and isolating frill-length cDNA and genomic 
clones containing said polynucleotide sequence. Thus in another aspect, HSCLOCK 
polynucleotides of the present invention further include a nucleotide sequence comprising a 
nucleotide sequence that hybridize under stringent condition to a nucleotide sequence having 
SEQ ID NO: 1 or a fragment thereof. Also included with HSCLOCK polypeptides are 
polypeptide comprising amino acid sequence encoded by nucleotide sequence obtained by the 
above hybridization condition. Such hybridization techniques are well known to those of skill in 
the art. Stringent hybridization conditions are as defined above or, alternatively, conditions under 
overnight incubation at 42°C in a solution comprising: 50% formamide, 5xSSC (150mM NaCl, 
15mM trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x Denhardt's solution, 10 % 
dextran sulfate, and 20 microgram/ml denatured, sheared salmon sperm DNA, followed by 
washing the filters in 0.1 x SSC at about 65°C. 

The polynucleotides and polypeptides of the present invention may be employed as 
research reagents and materials for discovery of treatments and diagnostics to animal and human 
disease. 

Vectors, Host Cells, Expression 

The present invention also relates to vectors which comprise a polynucleotide or 
polynucleotides of the present invention, and host cells which are genetically engineered with 
vectors of the invention and to the production of polypeptides of the invention by recombinant 
techniques. Cell-free translation systems can also be employed to produce such proteins using 
RNAs derived from the DNA constructs of the present invention. 

For recombinant production, host cells can be genetically engineered to incorporate 
expression systems or portions thereof for polynucleotides of the present invention. Introduction 
of polynucleotides into host cells can be effected by methods described in many standard 
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laboratory manuals, such as Davis et al. , BASIC METHODS IN MOLECULAR BIOLOGY (1986) 
and Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed., Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989) such as calcium phosphate 
transfection, DEAE-dextran mediated transfection, transvection, microinjection, cationic lipid- 
mediated transfection, electroporation, transduction, scrape loading, ballistic introduction or 
infection. 

Representative examples of appropriate hosts include bacterial cells, such as 
streptococci, staphylococci, E. coli, Streptomyces and Bacillus subtilis cells; fungal cells, such as 
yeast cells and Aspergillus cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; 
animal cells such as CHO, COS, HeLa, C127, 3T3, BHK, HEK 293 and Bowes melanoma cells; 
and plant cells. 

A great variety of expression systems can be used. Such systems include^ among others, 
chromosomal, episomal and virus-derived systems, e.g., vectors derived from bacterial plasmids, 
from bacteriophage, from transposons, from yeast episomes, from insertion elements, from yeast 
chromosomal elements, from viruses such as baculoviruses, papova viruses, such as SV40, 
vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and 
vectors derived from combinations thereof, such as those derived from plasmid and 
bacteriophage genetic elements, such as cosmids and phagemids. The expression systems may 
contain control regions that regulate as well as engender expression. Generally, any system or 
vector suitable to maintain, propagate or express polynucleotides to produce a polypeptide in a 
host may be used. The appropriate nucleotide sequence may be inserted into an expression 
system by any of a variety of well-known and routine techniques, such as, for example, those set 
forth in Sambrook et a/., MOLECULAR CLONING, A LABORATORY MANUAL {supra). 

For secretion of the translated protein into the lumen of the endoplasmic reticulum, into 
the periplasmic space or into the extracellular environment, appropriate secretion signals may be 
incorporated into the desired polypeptide. These signals may be endogenous to the polypeptide 
or they may be heterologous signals. 

If the HSCLOCK polypeptide is to be expressed for use in screening assays, generally, it 
is preferred that the polypeptide be produced at the surface of the cell. In this event, the cells 
may be harvested prior to use in the screening assay. If HSCLOCK polypeptide is secreted 
into the medium, the medium can be recovered in order to recover and purify the polypeptide; 
if produced intracellularly, the cells must first be lysed before the polypeptide is recovered. 

HSCLOCK polypeptides can be recovered and purified from recombinant cell cultures 
by well-known methods including ammonium sulfate or ethanol precipitation, acid extraction, 
anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite chromatography and lectin 

-10- 
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chromatography. Most preferably, high performance liquid chromatography is employed for 
purification. Well known techniques for refolding proteins may be employed to regenerate active 
conformation when the polypeptide is denatured during isolation and or purification. 

Diagnostic Assays 

This invention also relates to the use of HSCLOCK polynucleotides for use as diagnostic 
reagents. Detection of a mutated form of HSCLOCK gene associated with a dysfunction will 
provide a diagnostic tool that can add to or define a diagnosis of a disease or susceptibility to a 
disease which results from under-expression, over-expression or altered expression of 
HSCLOCK. Individuals carrying mutations in the HSCLOCK gene may be detected at the DNA 
level by a variety of techniques. 

Nucleic acids for diagnosis mav be obtained from a subiect's cells, such as from blood, 
urine, saliva, tissue biopsy or autopsy material. The genomic DNA may be used directly for 
detection or may be amplified enzymatically by using PCR or other amplification techniques 
prior to analysis. RNA or cDNA may also be used in similar fashion. Deletions and insertions 
can be detected by a change in size of the amplified product in comparison to the normal 
genotype. Point mutations can be identified by hybridizing amplified DNA to labeled 
HSCLOCK nucleotide sequences. Perfectly matched sequences can be distinguished from 
mismatched duplexes by RNase digestion or by differences in melting temperatures. DNA 
sequence differences may also be detected by alterations in electrophoretic mobility of DNA 
fragments in gels, with or without denaturing agents, or by direct DNA sequencing. See, e.g., 
Myers et al., Science (1985) 230: 1242. Sequence changes at specific locations may also be 
revealed by nuclease protection assays, such as RNase and SI protection or the chemical 
cleavage method. See Cotton etaL, Proc Natl Acad Sci USA (1985) 85: 4397-4401. In another 
embodiment, an array of oligonucleotides probes comprising HSCLOCK nucleotide sequence or 
fragments thereof can be constructed to conduct efficient screening of e.g., genetic mutations. 
Array technology methods are well known and have general applicability and can be used to 
address a variety of questions in molecular genetics including gene expression, genetic linkage, 
and genetic variability. (See for example: M.Chee et al., Science, Vol 274, pp 610-613 (1996)). 

The diagnostic assays offer a process for diagnosing or determining a susceptibility to 
sleep disorders, jet lag, pathologies that occur in advanced age through detection of mutation in 
the HSCLOCK gene by the methods described. 

In addition, sleep disorders, jet lag, pathologies that occur in advanced age, can be 
diagnosed by methods comprising determining from a sample derived from a subject an 
abnormally decreased or increased level of HSCLOCK polypeptide or HSCLOCK mRNA. 
Decreased or increased expression can be measured at the RNA level using any of the methods 
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well known in the art for the quantitation of polynucleotides, such as, for example, PCR, RT- 
PCR, RNase protection, Northern blotting and other hybridization methods. Assay techniques 
that can be used to determine levels of a protein, such as an HSCLOCK polypeptide, in a sample 
derived from a host are well-known to those of skill in the art. Such assay methods include 
radioimmunoassays, competitive-binding assays, Western Blot analysis and ELISA assays. 

Thus in another aspect, the present invention relates to a diagonostic kit for a disease 
or suspectability to a disease, particularly sleep disorders, jet lag, pathologies that occur in 
advanced age, which comprises: 

(a) a HSCLOCK polynucleotide, preferably the nucleotide sequence of SEQ ID NO: 1, or a 
fragment thereof; 

(b) a nucleotide sequence complementary to that of (a); 

fr\ * TTSPT nCV nnlvnentiHp nreferahlv the nolvnentiHe of SFO TD NO: 2. or a fragment 

V/ — — f — j f - t ? i j i j r - 1 _ _ ~ * - -v. ' *-/ 

thereof; or 

(d) an antibody to a HSCLOCK polypeptide, preferably to the polypeptide of SEQ ID NO: 2. 

It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise a substantial 
component. 

Antibodies 

The polypeptides of the invention or their fragments or analogs thereof, or cells 
expressing them can also be used as immunogens to produce antibodies immunospecific for the 
HSCLOCK polypeptides. The term "immunospecific" means that the antibodies have 
substantially greater affinity for the polypeptides of the invention than their affinity for other 
related polypeptides in the prior art. 

Antibodies generated against the HSCLOCK polypeptides can be obtained by 
administering the polypeptides or epitope-bearing fragments, analogs or cells to an animal, 
preferably a nonhuman, using routine protocols. For preparation of monoclonal antibodies, any 
technique which provides antibodies produced by continuous cell line cultures can be used. 
Examples include the hybridoma technique (Kohler, G. and Milstein, C, Nature (1975) 256:495- 
497), the trioma technique, the human B-cell hybridoma technique (Kozbor et aL, Immunology 
Today (1983) 4:72) and the EBV-hybridoma technique (Cole et aL, MONOCLONAL 
ANTIBODIES AND CANCER THERAPY, pp. 77-96, Alan R. Liss, Inc., 1985). 

Techniques for the production of single chain antibodies (U.S. Patent No. 4,946,778) 
can also be adapted to produce single chain antibodies to polypeptides of this invention. Also, 
transgenic mice, or other organisms including other mammals, may be used to express 
humanized antibodies. 
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The above-described antibodies may be employed to isolate or to identify clones 
expressing the polypeptide or to purify the polypeptides by affinity chromatography. 

Antibodies against HSCLOCK polypeptides may also be employed to treat sleep 
disorders, jet lag, pathologies that occur in advanced age, among others. 

Vaccines 

Another aspect of the invention relates to a method for inducing an immunological 
response in a mammal which comprises inoculating the mammal with HSCLOCK polypeptide, 
or a fragment thereof, adequate to produce antibody and/or T cell immune response to protect 
said animal from sleep disorders, jet lag, pathologies that occur in advanced age, among others. 
Yet another aspect of the invention relates to a method of inducing immunological response in 
a mammal which comprises^ delivering HSCLOCK polypeptide via a vector directing 
expression of HSCLOCK polynucleotide in vivo in order to induce such an immunological 
response to produce antibody to protect said animal from diseases. 

Further aspect of the invention relates to an immunological/vaccine formulation 
(composition) which, when introduced into a mammalian host, induces an immunological 
response in that mammal to a HSCLOCK polypeptide wherein the composition comprises a 
HSCLOCK polypeptide or HSCLOCK gene. The vaccine formulation may further comprise a 
suitable carrier. Since HSCLOCK polypeptide may be broken down in the stomach, it is 
preferably administered parenterally (including subcutaneous, intramuscular, intravenous, 
intradermal etc. injection). Formulations suitable for parenteral administration include 
aqueous and non-aqueous sterile injection solutions which may contain anti-oxidants, buffers, 
bacteriostats and solutes which render the formulation instonic with the blood of the recipient; 
and aqueous and non-aqueous sterile suspensions which may include suspending agents or 
thickening agents. The formulations may be presented in unit-dose or multi-dose containers, 
for example, sealed ampoules and vials and may be stored in a freeze-dried condition requiring 
only the addition of the sterile liquid carrier immediately prior to use. The vaccine 
formulation may also include adjuvant systems for enhancing the immunogenicity of the 
formulation, such as oil-in water systems and other systems known in the art. The dosage will 
depend on the specific activity of the vaccine and can be readily determined by routine 
experimentation. 

Screening Assays 

The HSCLOCK polypeptide of the present invention may be employed in a screening 
process for compounds which activate (agonists) or inhibit activation of (antagonists, or 
otherwise called inhibitors) the HSCLOCK polypeptide of the present invention. Thus, 
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polypeptides of the invention may also be used to assess identify agonist or antagonists from, for 
example, cells, cell-free preparations, chemical libraries, and natural product mixtures. These 
agonists or antagonists may be natural or modified substrates, ligands, receptors, enzymes, etc., 
as the case may be, of the polypeptide of the present invention; or may be structural or functional 
mimetics of the polypeptide of the present invention. See Coligan et al. 9 Current Protocols in 
Immunology l(2):Chapter 5 (1991). 

HSCLOCK polypeptides are responsible for many biological functions, including many 
pathologies. Accordingly, it is desirous to find compounds and drugs which stimulate 
HSCLOCK polypeptide on the one hand and which can inhibit the function of HSCLOCK 
polypeptide on the other hand. In general, agonists are employed for therapeutic and 
prophylactic purposes for such conditions as sleep disorders, jet lag, pathologies that occur in 
advanced age. Antagonists may be employed for a variety of therapeutic and prophylactic 
purposes for such conditions as sleep disorders, jet lag, pathologies that occur in advanced age. 

In general, such screening procedures may involve using appropriate cells which express 
the HSCLOCK polypeptide or respond to HSCLOCK polypeptide of the present invention. Such 
cells include cells from mammals, yeast, Drosophila or E. coli. Cells which express the 
HSCLOCK polypeptide (or cell membrane containing the expressed polypeptide) or respond to 
HSCLOCK polypeptide are then contacted with a test compound to observe binding, or 
stimulation or inhibition of a functional response. The ability of the cells which were contacted 
with the candidate compounds is compared with the same cells which were not contacted for 
HSCLOCK activity. 

The assays may simply test binding of a candidate compound wherein adherence to the 
cells bearing the HSCLOCK polypeptide is detected by means of a label directly or indirectly 
associated with the candidate compound or in an assay involving competition with a labeled 
competitor. Further, these assays may test whether the candidate compound results in a signal 
generated by activation of the HSCLOCK polypeptide, using detection systems appropriate to 
the cells bearing the HSCLOCK polypeptide. Inhibitors of activation are generally assayed in 
the presence of a known agonist and the effect on activation by the agonist by the presence of 
the candidate compound is observed. 

Further, the assays may simply comprise the steps of mixing a candidate compound 
with a solution containing a HSCLOCK polypeptide to form a mixture, measuring HSCLOCK 
activity in the mixture, and comparing the HSCLOCK activity of the mixture to a standard. 

The HSCLOCK cDNA, protein and antibodies to the protein may also be used to 
configure assays for detecting the effect of added compounds on the production of HSCLOCK 
mRNA and protein in cells. For example, an ELISA may be constructed for measuring 
secreted or cell associated levels of HSCLOCK protein using monoclonal and polyclonal 
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antibodies by standard methods known in the art, and this can be used to discover agents which 
may inhibit or enhance the production of HSCLOCK (also called antagonist or agonist, 
respectively) from suitably manipulated cells or tissues. 

The HSCLOCK protein may be used to identify membrane bound or soluble receptors, 

5 if any, through standard receptor binding techniques known in the art. These include, but are 
not limited to, ligand binding and crosslinking assays in which the HSCLOCK is labeled with a 
radioactive isotope (eg 1251), chemically modified (eg biotinylated), or fused to a peptide 
sequence suitable for detection or purification, and incubated with a source of the putative 
receptor (cells, cell membranes, cell supernatants, tissue extracts, bodily fluids). Other 

10 methods include biophysical techniques such as surface plasmon resonance and spectroscopy. 
In addition to being used for purification and cloning of the receptor, these binding assays can 

be used to identify agonists and antagonists of HSCLOCK which comnete with the bindine of 

■ j <_> <~> » »-' 

HSCLOCK to its receptors, if any. Standard methods for conducting screening assays are well 



£3 understood in the art. 

P 

r=j 15 Examples of potential HSCLOCK polypeptide antagonists include antibodies or, in some 
cases, oligonucleotides or proteins which are closely related to the ligands, substrates, receptors, 

01 

y3 enzymes, etc., as the case may be, of the HSCLOCK polypeptide, e.g., a fragment of the ligands, 

r * substrates, receptors, enzymes, etc.; or small molecules which bind to the polypetide of the 

a 

present invention but do not elicit a response, so that the activity of the polypeptide is prevented. 

Host 

p 20 Thus in another aspect, the present invention relates to a screening kit for identifying 

03 agonists, antagonists, ligands, receptors, substrates, enzymes, etc. for HSCLOCK polypeptides; 

T7 or compounds which decrease or enhance the production of HSCLOCK polypeptides, which 

comprises: 

(a) a HSCLOCK polypeptide, preferably that of SEQ ID NO:2; 
25 (b) a recombinant cell expressing a HSCLOCK polypeptide, preferably that of SEQ ID NO:2; 



(c) a cell membrane expressing a HSCLOCK polypeptide; preferably that of SEQ ID NO: 2; or 

(d) antibody to a HSCLOCK polypeptide, preferably that of SEQ ID NO: 2. 

It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise a substantial 
component. 

30 

Prophylactic and Therapeutic Methods 

This invention provides methods of treating abnormal conditions such as, sleep 
disorders, jet lag, pathologies that occur in advanced age, related to both an excess of and 
insufficient amounts of HSCLOCK polypeptide activity. 
35 If the activity of HSCLOCK polypeptide is in excess, several approaches are available. 

One approach comprises administering to a subject an inhibitor compound (antagonist) as 
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hereinabove described along with a pharmaceutical^ acceptable carrier in an amount effective to 
inhibit the function of the HSCLOCK polypeptide, such as, for example, by blocking the binding 
of ligands, substrates, receptors, enzymes, etc., or by inhibiting a second signal, and thereby 
alleviating the abnormal condition. In another approach, soluble forms of HSCLOCK 
polypeptides still capable of binding the ligand, substrate, enzymes, receptors, etc. in 
competition with endogenous HSCLOCK polypeptide may be administered. Typical 
embodiments of such competitors comprise fragments of the HSCLOCK polypeptide. 

In still another approach, expression of the gene encoding endogenous HSCLOCK 
polypeptide can be inhibited using expression blocking techniques. Known such techniques 
involve the use of antisense sequences, either internally generated or separately administered. 
See, for example, O'Connor, JNeurochem (1991) 56:560 in Oligodeoxynucleotides as 

n?r PrpQs TWa Raton FT. fl98RY Alternativelv. 

oligonucleotides which form triple helices with the gene can be supplied. See, for example, 
Lee et ai, Nucleic Acids Res (1979) 6:3073; Cooney et al, Science (1988) 241:456; Dervan et 
al., Science (1991) 251:1360. These oligomers can be administered perse or the relevant 
oligomers can be expressed in vivo. 

For treating abnormal conditions related to an under-expression of HSCLOCK and its 
activity, several approaches are also available. One approach comprises administering to a 
subject a therapeutically effective amount of a compound which activates HSCLOCK 
polypeptide, i.e., an agonist as described above, in combination with a pharmaceutically 
acceptable carrier, to thereby alleviate the abnormal condition. Alternatively, gene therapy may 
be employed to effect the endogenous production of HSCLOCK by the relevant cells in the 
subject. For example, a polynucleotide of the invention may be engineered for expression in a 
replication defective retroviral vector, as discussed above. The retroviral expression construct 
may then be isolated and introduced into a packaging cell transduced with a retroviral plasmid 
vector containing RNA encoding a polypeptide of the present invention such that the packaging 
cell now produces infectious viral particles containing the gene of interest. These producer cells 
may be administered to a subject for engineering cells in vivo and expression of the polypeptide 
in vivo. For overview of gene therapy, see Chapter 20, Gene Therapy and other Molecular 
Genetic-based Therapeutic Approaches, (and references cited therein) in Human Molecular 
Genetics, T Strachan and A P Read, BIOS Scientific Publishers Ltd (1996). Another approach is 
to administer a therapeutic amount of HSCLOCK polypeptides in combination with a suitable 
pharmaceutical carrier. 
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Formulation and Administration 

Peptides, such as the soluble form of HSCLOCK polypeptides, and agonists and 
antagonist peptides or small molecules, may be formulated in combination with a suitable 
pharmaceutical carrier. Such formulations comprise a therapeutically effective amount of the 
polypeptide or compound, and a pharmaceutically acceptable carrier or excipient. Such carriers 
include but are not limited to, saline, buffered saline, dextrose, water, glycerol, ethanol, and 
combinations thereof. Formulation should suit the mode of administration, and is well within the 
skill of the art. The invention further relates to pharmaceutical packs and kits comprising one or 
more containers filled with one or more of the ingredients of the aforementioned compositions of 
the invention. 

Polypeptides and other compounds of the present invention may be employed alone or in 

fhp»raTv»ntir rnmr\r\} inn Q 

U.11V^ 11VJ1I Willi UU1V1 VV/lllJ/UUiluU, ljuwaa. viivim|/v>.w« w < — f 

Preferred forms of systemic administration of the pharmaceutical compositions include 
injection, typically by intravenous injection. Other injection routes, such as subcutaneous, 
intramuscular, or intraperitoneal, can be used. Alternative means for systemic administration 
include transmucosal and transdermal administration using penetrants such as bile salts or fusidic 
acids or other detergents. In addition, if properly formulated in enteric or encapsulated 
formulations, oral administration may also be possible. Administration of these compounds may 
also be topical and/or localized, in the form of salves, pastes, gels and the like. 

The dosage range required depends on the choice of peptide, the route of administration, 
the nature of the formulation, the nature of the subject's condition, and the judgment of the 
attending practitioner. Suitable dosages, however, are in the range of 0.1-100 ng/kg of subject. 
Wide variations in the needed dosage, however, are to be expected in view of the variety of 
compounds available and the differing efficiencies of various routes of administration. For 
example, oral administration would be expected to require higher dosages than administration by 
intravenous injection. Variations in these dosage levels can be adjusted using standard empirical 
routines for optimization, as is well understood in the art. 

Polypeptides used in treatment can also be generated endogenously in the subject, in 
treatment modalities often referred to as "gene therapy" as described above. Thus, for example, 
cells from a subject may be engineered with a polynucleotide, such as a DNA or RNA, to encode 
a polypeptide ex vivo, and for example, by the use of a retroviral plasmid vector. The cells are 
then introduced into the subject. 

All publications, including but not limited to patents and patent applications, cited in 
this specification are herein incorporated by reference as if each individual publication were 
specifically and individually indicated to be incorporated by reference herein as though fully 
set forth. 
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SEQIDNO:l 

1 AGCTGATTCTATCACATTGTAAGATGCCTTTGGATAATTCTACAGTCCTCTTAAATGAAT 

61 CTTTAGAACTTGGCAAGTCTCACTAGATACCTTCAATCATCATTTTGAGCTCAAAGAATT 

121 CTGAGACTTATGGTTGGTCATATAGAAGAGGACCTTGAACCTATAGTTTCCTGAAGAATC 

181 AGTTTAAAAGATCCAAGGAGTACAAAAGGAGAAGTACAAATGTCTACTACAAGACGAAAA 

241 CGTAGTATGTTATGTTGTTTACCGTAAGCTGTAGTAAAATGAGCTCGATTGTTGACAGAG 

3 01 ATGACAGTAGTATTTTTGATGGGTTGGTGGAAGAAGATGACAAGGACAAAGCGAAAAGAG 

3 61 TATCTAGAAACAAATCTGAAAAGAAACGTAGAGATCAATTTAATGTTCTCATTAAAGAAC 
421 TGGGATCCATGCTTCCTGGTAATGCTAGAAAGATGGACAAATCTACTGTTCTGCAGAAAA 

4 81 GCATTGATTTTTTACGAAAACATAAAGAAATCACTGCACAGTCAGATGCTAGTGAAATTC 
541 GACAGGACTGGAAACCTACATTCCTTAGTAATGAAGAGTTTACACAATTAATGTTAGAGG 
601 CTCTTGATGGTTTTTTTTTAGCAATCATGACAGATGGAAGCATAATATATGTGTCTGAGA 
661 GTGTAACTTCATTACTTGAACATTTACCATCTGATCTTGTGGATCAAAGTATATTTAATT 
721 TTATCCCAGAAGGGGAACATTCAGAGGTTTATAAAATACTCTCTACTCATCTGCTGGAAA 
781 GTGATTCATTAACCCCAGAATATTTAAAATCAAAAAATCAGTTAGAATTCTGTTGTCACA 
841 TGCTGCGAGGAACAATAGAC C CAAAGGAGCC ATCTAC CTATGAATATGTAAAATTTATAG 
901 GAAATTTCAAATCTTTAAACAGTGTATCCTCTTCAGCACACAATGGTTTTGAAGGAACTA 
961 TACAACGCACACATAGGCCATCTTATGAAGATAGAGTTTGTTTTGTAGCTACTGTCAGGT 

1021 TAGCTACACCTCAGTTCATCAAGGAAATGTGCACTGTTGAAGAACCCAATGAAGAGTTTA 

1081 CATCTAGACATAGTTTAGAATGGAAGTTTCTGTTTCTAGATCACAGGGCACCACCCATAA 

1141 TAGGGTATTTGCCATTTGAAGTTCTGGGAACATC AGGCTATGATTACTATCATGTGGATG 

12 01 ACCTAGAAAATTTGGC AAAATGTCATGAGCACTTAATGC AATATGGGAAAGGCAAATCAT 

12 61 GTTATTATAGGTTCCTGACTAAGGGGCAACAGTGGATTTGGCTTCAGACTCATTATTATA 

13 21 TCACTTACCATCAGTGGAATTCAAGGCCAGAGTTTATTGTTTGTACTCACACTGTAGTAA 
13 81 GTTATGCAGAAGTTAGGGCTGAAAGACGACGAGAACTTGGCATTGAAGAGTCTCTTCCTG 
1441 AGACAGCTGCTGACAAAAGCCAAGATTCTGGGTC AGATAATCGTATAAAC ACAGTCAGTC 
1501 TCAAGGAAGCATTGGAAAGGTTTGATCACAGCCCAACCCCTTCTGCCTCTTCTCGGAGTT 
1561 CAAGAAAATCATCTCACACGGCCGTCTCAGACCCTTCCTCAACACCAACCAAGATCCCGA 
1621 CGGATACGAGCACTCCACCCAGGCAGCATTTACCAGCTCATGAGAAGATGGTGCAAAGAA 
1681 GGTCATCATTTAGTAGTCAGTCCATAAATTCCCAGTCTGTTGGTTCATCATTAACACAGC 
1741 CAGTGATGTCTCAAGCTACAAATTTACCAATTCCACAAGGCATGTCCCAGTTTCAGTTTT 
1801 CAGCTCAATTAGGAGCCATGCAACATCTGAAAGACCAATTGGAACAACGGACACGCATGA 
1861 TAGAAGCAAATATTCATCGGCAACAAGAAGAACTAAGAAAAATTCAAGAACAACTTCAGA 
1921 TGGTCCATGGTCAGGGGCTGCAGATGTTTTTGCAACAATCAAATCCTGGGTTGAATTTTG 
1981 GTTCCGTTCAACTTTCTTCTGGAAATTCATCTAACATCCAGCAACTTGCACCTATAAATA 
2 041 TGCAAGGCCAAGTTGTTCCTACTAACCAGATTCAAAGTGGAATGAATACTGGACACATTG 
2101 GCACAACTCAGCACATGATACAACAACAGACTTTACAGAGTACATCAACTCAGAGTCAAC 
2161 AAAATGTACTGAGTGGGCACAGTCAGCAAACATCTCTACCCAGTCAGACACAGAGCACTC 
2221 TTACAGCCCCACTGTATAACACTATGGTGATTTCTCAGCCTGCAGCCGGAAGCATGGTCC 
22 81 AGATTCCATCTAGTATGCCACAAAACAGCACCCAGAGTGCTGCAGTAACTACATTCACTC 
2341 AGGACAGGCAGATAAGATTTTCTCAAGGTCAACAACTTGTGACCAAATTAGTGACTGCTC 
24 01 CTGTAGCTTGTGGGGCAGTCATGGTACCTAGTACTATGCTTATGGGCCAGGTGGTGACTG 
24 61 CATATCCTACTTTTGCTACACAACAGCAACAGTCACAGACATTGTCAGTAACGCAGCAGC 
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2521 AGCAGCAGCAGAGCTCCCAGGAGCAGCAGCTCACTTCAGTTCAGCAACCATCTCAGGCTC 

2581 AGCTGACCCAGCCACCGCAACAATTTTTACAGACTTCTAGGTTGCTCCATGGGAATCCCT 

2641 CAACTCAACTCATTCTCTCTGCTGCATTTCCTCTACAACAGAGCACCTTCCCTCAGTCAC 

2701 ATCACCAGCAACATCAGTCTCAGCAACAGCAGCAACTCAGCCGGCACAGGACTGACAGCT 

27 61 TGCCCGACCCTTCCAAGGTTCAACCACAGTAGCACACGTGCTTCCTCTCTTGACATCAAG 

2 821 GGAGGAAGGGGATGGCCCATTAAGAGTTACTCAGATGACCTGAGGAAAGGAGGGAAAGTT 

2 8 81 CCAGCAGTTTCATGAGATGCAGTATTGAGTGTTCTAGTTCCTGGAATTAGTTGGCAGAGA 

2 941 AAATGCTGCCTAGTGCTACAGATGTACATTAAATACCAGCCAGCAGGAGGTGATCATAGG 

3 0 01 GGCACAGCCAGTTCTGACAGTGTTTTAGGTGCCTGGATATTTTTTGATGGAAAAAGAATA 
3 061 TATTGCCAAATATTAAGAAGCTCAGCTATGAAATGACCTCCAGGGAATCAGAAAGGCACT 
3121 AATGATGTTAGTAACTTTTAGTGGTTCTGTGCCTCTTATCAAGTGTTACAGAGGACATAC 
3181 CACTGCCATGTCAGGGGTTTGCTTACAGTGATGCCATGAAGACAGTCCAGTAGACTTGGT 
3241 AGCGACCCCCTCCCCCAACCCCTCTCCCTTTTCAGATAATGATGGAACAGTAATTACTTT 
3 3 01 CAGAATGTTGTGTGGGTTCAAATTCTCTATGTACAGATGATGTAAAAATATGTATATGTC 

_ _ _ -m -» -* -n -r. m f Timm nrmt m^rtrnnriTv rp/*i 7\ 7\ 7\ f* r^r^fT >r T l T\. r T*r* r T i (~ %r T >r T l C % C* T T % 

3421 TACAGGTGTGAGCACCTTTCCTGAAATTCTGACACCATGTGCAAACTGATCCATCCTGTT 

34 81 TTTCCTTTTGTTTACAACACAGTAGTGTTCTGTTCACTTTTCCGGGGCACAAGTTTTTTT 

3541 GTTCATACTTTGGCTGTGATGTC ACAGTTTGTTC AGTGAGGTATGATGTGCTGCTGGGAA 

3 6 01 TGGATTTTTTTTTTTCAGGTTAAATTATTGATACAACAGGATTTTCAAGTTATTCAGAAA 

3661 TATCCCTCATTTCATTATTTTTCAATTATGTTTGAAAATAGGATTTGCACTGCTTTATTT 

3721 TAGGTGGCTGGGAGTTTTGATTGCATATTTTGTTATAGTTCATAGTTGGAAATATTTGCG 

3781 TAAATGGTTTTCAACAAGCCTGAAAGTAATTTCAAGAATGTTTCAGTTATAGAGGTAAAA 

3 841 TTTGCACACAAAACATCTTAGGCACTTTTTAACATTCTCAATCATGGGAATTTTAACTTT 

3 901 TGGGATTTGTTGAAATCTTTTTTATTATCCTTCACAATTTCAATGCTTCTTTTAGTCAGA 

3 961 AATGATTCAGGGTTATTTGAGGGGAAAAAACCCCATAGTGCCTTGATTTTAATTCAGGTG 

4 021 ATAACTCACCATCTTGAAGTCATTGTCCGGTTTCCGTAGCAGTTTTGAAACCTTAGTACC 
4 081 TTTTTAACAGCATGTGGGTGTCAGTGTCATTATTAGTCTCCTAATAAGTTCCTCTGAAGA 
4141 CTGCTATCAGTCTCTTGGACTGGAGTTACAAATAATTTAGAAATAAAAGATGATAACCTA 

42 01 ACACTATCATAGTTATTAATGTGATCCTAAAATTGTTTCCTAAATCAGCATTTTTCTTTA 
4261 GTCATTTAAGAATTTAC C AGAAATATTTGCTCAATATGATCTTGATATTC CTAC AAAGAA 
4321 AAAAGAAGGGGTAGGGATTTGGCTATGCCTTCACTACAACATTAGAATATTGTAACTCAC 

43 81 ATGCCTTCTAAACGTGAACTAAGATTTCCTTTGGCAATATCATATTCTAAAAGTAATAAA 
4441 TTCCAATACAAGTTACATACATTTAAAAAACATTTTACAGATTTTATGGTACTAATGAAA 
4501 TTTACAGTGATAGAACAAAAGAGGATTAGTAGAAAATACATTATTAGAATATAAAAAATG 
4561 TTATTACTGAGGAAAGGGAGGAGAGGACAAGTGTAATAAATCAAAATTGACCTCAAAAGA 
4621 AAATGTGTAAC AGAGTTGAGGTTGTTAAAACAGAAAAGGTTCTGAATAATGAAGATTAAC 
4681 CTAATGCAGAATTGCTAGGTAAAGAGGTCAGGGGAATGCTAAGCCAGTTCTTAAGACTTC 
4741 TCTGTCCTCTGCTTTGCTGTTATCCTTAAGGCATATACTTTGTCTTTCTGCAGAAAATTC 
4801 TACCTGGCTACAATTACTTTGAACATTAATGTTGAAAAAGAAAACAACCAAAGAAAATTG 
4 861 GTACTTACCCTTCTACAAAAGAAGTGTGACTAGATATCAATCAGTAATTAACATATCAAG 
4921 GAGCTCTTCTAGCTAAATGACCATCCAGTAGAGATTTCCCACATTCCCATGAATATCAAG 
4 981 AATAGTTGTCAGAATATGTATGTACCTGAGCATATGTACACAGACAAGGGGGATGTTGTG 
5041 GAATATGGCAATAGCATTGTTCTTCTCCCCTTTCAAATTGCCTTTCTTGACCTTATGCCA 
5101 TTCCATATATATCTGAGTTGTGCCTCATTTATTTATTGGCAATACCTAGTGATACGGATT 
5161 TAGCTAACAAAAGATATGAAGAACTATTATATTGAGGCCTGTCCTCTAC ATACCACACTT 
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5221 AAAAGATGGTGAACTGTGAGTACTACTTAGGTTGACAGCAACAAAGCATAAGACAAGCCC 

5281 CAGGTAAACGTCTAAACTGTTTACTCACATTGTCCTACTCCAGCCCCTTCAATTATTTCC 

5341 CATCTCCACAAATAGTCGGGGGAAAAAATTAAAATTTTCCTTTATGATTCTTACTGTTCT 

54 01 TCGCAGCTCATCTTTTCCTGCTTAGAATTAACCATTGCTAATTTAAAGGAGCAGCTAGCT 

5461 GCTTTTCTGTCAGTCTGAAGCGTAGTAGTGGAAGAGGTAGTAAGCACCAGCTGCCTCTTT 

5521 GCTGCTTTGTTTTCCTCCTGATTCTCTTAAATTTGGGTTGCAAAGCTATCCCGCCCCCCA 

5581 CCCTGCCCCATGAAACTTGAGCATTCAAATGAAGATTCAGCAGTGTCTGTTCTTCATTTC 

5641 TATAGCCAAAGCTGTTAGTTAAAATCCCAAATCTATAGCATTTAAAGATACCAAATAGAA 

5701 ACACCTTCCAGCTTT 5715 



SEQ ID NO:2 

1 ML FTVS CS KMS S I VDRDD SSI FDGLVEEDDKDKAKRVS RNKS E KKRRDQFNVL I KELGSM 

61 LPGNARKMDKSTVLQKSIDFLRKHKEITAQSDASEIRQDWKPTFLSNEEFTQLMLEALDG 

121 FFLAIMTDGS 1 1 WSESVTSLLEHLPSDLVDQS1 FNFlPEGEHS'EVYKILSTKLLESDSL 

181 TPEYLKSKNQLEFCCHMLRGTIDPKEPSTYEYVKFIGNFKSLNSVSSSAHNGFEGTIQRT 

241 HRPSYEDRVCFVATVRLATPQFIKEMCTVEEPNEEFTSRHSLEWKFLFLDHRAPPIIGYL 

301 PFEVLGTSGYDYYHVDDLENLAKCHEHLMQYGKGKSCYYRFLTKGQQWIWLQTHYYITYH 

3 61 QWNSRPEFIVCTHTWSYAEVRAERRRELGIEESLPETAADKSQDSGSDNRINTVSLKEA 
421 LERFDHSPTPSASSRSSRKSSHTAVSDPSSTPTKIPTDTSTPPRQHLPAHEICMVQRRSSF 

4 81 SSQSINSQSVGSSLTQPVMSQATNLPIPQGMSQFQFSAQLGAMQHLKDQLEQRTRMIEAN 
541 IHRQQEELRKIQEQLQMVHGQGLQMFLQQSNPGLNFGSVQLSSGNSSNIQQLAPINMQGQ 
601 WPTNQIQSGMNTGHIGTTQHMIQQQTLQSTSTQSQQNVLSGHSQQTSLPSQTQSTLTAP 
661 LYNTMVISQPAAGSMVQIPSSMPQNSTQSAAVTTFTQDRQIRFSQGQQLVTKLVTAPVAC 
721 GAVMVPSTMLMGQWTAYPTFATQQQQSQTLSVTQQQQQQSSQEQQLTSVQQPSQAQLTQ 
7 81 PPQQFLQTSRLLHGNPSTQLILSAAFPLQQSTFPQSHHQQHQSQQQQQLSRHRTDSLPDP 
841 SKVQPQ 846 
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